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Abstract 

We derive an algorithm in the spirit of Rogers [18] and Davis, Burstein [2] that leads to 
upper bounds for stochastic control problems. Our bounds complement lower biased estimates 
recently obtained in Guyon, Henry-Labordere m- We evaluate our estimates in numerical 
examples motivated from mathematical finance. 

1 Introduction 

Solving stochastic control problems, for example by approximating the Hamilton-Jacobi- 
Bellman (HJB) equation, is an important problem in applied mathematics. Classical PDE 
methods are effective tools for solving such equations in low dimensional settings, but quickly 
become computationally intractable as the dimension of the problem increases: a phenomenon 
commonly referred to as ’’the curse of dimensionality”. Probabilistic methods on the other 
hand such as Monte-Carlo simulation are less sensitive to the dimension of the problem. R was 
demonstrated in Pardoux & Peng m and Cheridito, Soner, Touzi & Victoir [3] that first and 
second backward stochastic differential equations (in short BSDE) can provide stochastic rep¬ 
resentations that may be regarded as a non-linear generalization of the classical Feynman-Kac 
formula for semi-linear and fully non-linear second order parabolic PDEs. 

The numerical implementation of such a BSDE based scheme associated to a stochastic 
control problem was first proposed in Bouchard & Touzi [2], also independently in Zhang 
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[20]. Further generalization was provided in Fahim, Touzi & Warin [H] and in Guyon & 
Henry-Labordere [10] . The algorithm in [10] requires evaluating high-dimensional conditional 
expectations, which are typically computed using parametric regression techniques. Solving 
the BSDE yields a sub-optimal estimation of the stochastic control. Performing an additional, 
independent (forward) Monte-Carlo simulation using this sub-optimal control, one obtains a 
biased estimation: a lower bound for the value of the underlying stochastic control problem. 
Choosing the right basis for the regression step is in practice a difficult task, particularly in 
high-dimensional settings. In fact, a similar situation arises for the familiar Longstaff-Schwarz 
algorithm, which also requires the computation of conditional expectations with parametric 
regressions and produces a low-biased estimate. 

As the algorithm in [10] provides a biased estimate, i.e. a lower bound it is of limited use 
in practice, unless it can be combined with a dual method that leads to a corresponding upper 
bound. Such a dual expression was obtained by Rogers [18] , building on earlier work by Davis 
and Burstein [4] . While the work of Rogers is in the discrete time setting, it applies to a general 
class of Markov processes. Previous work by Davis and Burstein [4] linking deterministic and 
stochastic control using flow decomposition techniques (see also Diehl, Friz, Gassiat [5] for 
a rough path approach to this problem) is restricted to the control of a diffusion in its drift 
term. In the present paper we are also concerned with the control of diffusion processes, but 
allow the control to act on both the drift and the volatility term in the diffusion equation. The 
basic idea underlying the dual algorithm in all these works is to replace the stochastic control 
by a pathwise deterministic family of control problems that are not necessarily adapted. The 
resulting ’’gain” of information is compensated by introducing a penalization analogous to a 
Lagrange multiplier. In contrast to [4] and [5], we do not consider continuous pathwise, i.e. 
deterministic, optimal control problems. Instead, we rely on a discretization result for the 
HJB equation due to Krylov |13 | and recover the solution of the stochastic control problem as 
the limit of deterministic control problems over a finite set of discretized controls. 

Our paper is structured as follows. In Section[2]we introduce the stochastic control problem 
and derive the dual bounds in the Markovian setting for European type payoffs. In Section 
\m we generalize our estimates to a non-Markovian setting, i.e. where the payoff has a path 
dependence. Einally, in Section [3.21 we consider a setting suitable for pricing American style 
options in a Markov setting. We evaluate the quality of the upper bounds obtained in two 
numerical examples. First, we consider the pricing of a variety of options in the uncertain 
volatility model. Based on our earlier estimates we transform the stochastic optimization 
problem into a family of suitably discretized deterministic optimizations, which we can in turn 
approximate for example using local optimization algorithms. Second, we consider a problem 
arising in credit valuation adjustment. In this example, the deterministic optimization can 
particularly efficiently be solved by deriving a recursive ODE solution to the corresponding 
Hamilton-Jacobi equations. Our algorithm complements the lower bounds derived in m by 
effectively re-using some of the quantities already computed when obtaining the lower bounds 
(cf. Remark 12.81) . 
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Duality result for European options 


2.1 Notations 

We begin by introducing some basic notations. For any fc G N let 

o'" ■-{uj : LU £ ^([O, T], E'=), ujo = 0}. 

Let d, m G N and T > 0. Define O, := 0“^, 0 := [0, T] x D and let B denote the canonical 
process on 0"“ with F = {J^t}o<t<T the filtration generated by B. Finally, denote by Pq the 
Wiener measure. 

For h > 0, consider a finite partition {ti}i of [0,T] with mesh less than h, i.e. such that 

ti+i — t’l <h for all i. For some M > 0, let A be a compact subset of 

Om := {a; G : \x\ < M}, for some fc G N, 

and be a finite /i-net of A, i.e. for all a,b £ C A, we have |a — 6| < h. We define sets: 

• A : 0 ^ : ip is F-adapted, and takes values in 

• Ah ■= £ A'■ p is constant on \t*i for i, and takes values in At^|; 

• U := : 0 —>■ R'* : is bounded and F-adapted|; 

• Bh := |/ : [0, T] ^ R*^ : / is constant on for i, and takes values in At^|. 

For the following it is important to note that Bh is a finite set of piecewise constant functions. 

We would like to emphasize that, throughout this paper, C denotes a generic constant, 
which may change from line to line. For example the reader may find 2C < C, without any 
contradiction as the left-hand side C is different from the right-hand side C. 

2.2 The Markovian case 

We consider stochastic control problems of the form: 

uo = supE‘’“[ r + (2.1) 

oi^A ^Jo -* 

where := e~ X“ is a d-dimensional controlled diffusion defined by 

f fi{t,at,X^)dt+ f o[t,at,XT)dBt, 

Jo Jo 

and the functions /r, a, /, r satisfy the following assumption. 

Assumption 2.1 The functions fi, a, f,r defined on x AxMJ^ take values inW^ 
respectively. Assume that 

• fi, a, f, r are uniformly bounded, and continuous in a; 

• fi, a, f, r are uniformly So-Holder continuous in t, for some fixed constant So G (0,1]; 

• are uniformly Lipschitz in x, and f,r are uniformly So-Holder continuous in x; 
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• g : —>■ R is continuous. 

Remark 2.2 Our assumptions match the assumptions on the continuity of the coeffieients in 
Krylov mM’ and allow us to apply his results. 

Our main result is a duality in the spirit of [4] that allows us to replace the stochastic 
control problem by a family of suitably discretized deterministic control problems. We first 
discretize the control problem through the following lemma which is a direct consequence of 
Theorem 2.3 in Krylov m- 
Define the function 

Mo := sup E*”” 

a^Ah 

Lemma 2.3 Suppose Assumvtion \2.1\ holds and g is bounded. We have for any family of 
partition of [0, T] with mesh tending to zero that 

Mo = lim Mn. (2.2) 

h^O 


R?f{t,at,Xndt + mg(X¥)]. 


Remark 2.4 Theorem 2.3 in m also gives a rate of convergence for the discretization in 
Lemma \2.3[ i.e. there exists a constant C > 0 such that 

I Mo — Mo I < Ch^, for all 0 < h < 1. 

For the following statement, we introduce: 

m'* := inf,^gM l^maxagD;, , with 

~R-g{X^) + J^ Rtf{t,at,X^)dt- Rfipt{X'^y<j{t,at,Xi)dBt. 

Remark 2.5 It is noteworthy that stochastic integrals are defined in L^-space, so it is in gen¬ 
eral meaningless to take the pathwise supremum of a family of stochastic integrals. However, 
as we mentioned before, the set T>h is of hnite elements. So there is a unique random variable 
in equal to the maximum value of the finite number of stochastic integrals, Po-a.s. 

The next theorem allows to recover the stochastic optimal control problem as a limit of 
discretized deterministic control problems. 

Theorem 2.6 Suppose Assumvtion \2. 1\ holds and g is bounded. Then we have 

Mo = lim M^. 
h —^0 

Proof We first prove that mo < lim ,, .nu'*. Recall Ug defined in (IQ) . Since , cr are 
bounded, for a.11 (p € U the process Rf Tt[X°‘yait, at, Xt)dBt is a martingale. So we have 

mS = sup . 

aSAh 


4 


Since < maxaeVh ‘1’“’'^ for all a G Ah, we have 

4 < E’‘“ [ max $“’^1. 

'■ae®h J 

The required result follows. 


To show uo > lini/i_>ou^ we construct an explicit minimizer ip*. First note that under As¬ 
sumption EU it is easy to verify that ut defined as 

M(t, a:) := sup [ f ^f{s,as,X^)ds+^g{XT)\Xt=x\, 

OL^A ^Jt 


is a viscosity solution to the Dirichlet problem of the HJB equation: 
-dtu - suptg^ {d’u -I- fit, b,x)} = 0, UT = g, 


(2.4) 


(2.5) 


where C^u := /r(t, b, x) ■ dxU -I- iTr((cr(T''')(t, b, a;)5^2,u) — r(t, b, x)u. 

We next define the mollification := u * of u, where K is a smooth function with 
compact support in ( —1, 0) x Oi (Oi is the unit ball in R"^), and (x) := £“”“^Ar(t/£^, x/e). 
Clearly, G Ch° and converges uniformly to u. As mentioned in R emark 1 2.2 1 Assump¬ 
tion ED matches the assumptions in [14], where the author proved in his Theorem 2.1 that 

u^^^ is a classical supersolution to the HJB equation (12.411 . 

Denote 

■= dxU^^\t,0Jt). 

Since G , it follows from the Ito’s formula that 

Jo 

+ f Rtv>tiXya{t,at,Xf)dBt, for all a G Po-a.s. 

Jo 

Then, by the definition of in (12.311 . we obtain 

^ A?(/(t,at,Xr)-b 44"^-br“‘4">)(t,W“))dt 

- R^u^‘^\t, X^) + for all a G Po-a.s. 

Since is a supersolution to the HJB equation (12.41) . it follows that 

4-“’'^' < J??,(g(X^)-4"^(r,X?))for all a G D;,, Po-a.s. 

By Assumption 12.II and the fact that g is bounded, 

'1?“’'^ is uniformly bounded from above. 


( 2 . 6 ) 


(2.7) 


Also, it is easy to verify that the function u is continuous and therefore uniformly continuous 
on Sl ■= [0, T] X {|a:| < L} for any L > 0 and that converges uniformly to u on Sl- In 
particular. 


(e) . 

^ Uo, 

PL (e) := max|„;|<i \g (x) - (T, x) 0, 


as £ —^ 0. 


( 2 . 8 ) 
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It follows from (ITUl) . FTfl and (1^ that 

E’‘“ [ max $“■‘^'1 = E'^“ [ max <1-“’'^'; max IXJI < il + E”*" [ max ; max \X^\ > l] 

LaGl?/i J \.a£T>f^ a^T>}^ J LaG'D/j a,^X>^ J 

< CpL (e) + + CPo [ max I > Ll, 

where C is a constant independent of L and e. Therefore 

< lim E'’’“ [ max 1 < uo + CVq \ max IX^l > L] , for any L > 0. 

£—>0 Lctex>/i J 

Further, since 

Po[max |X^| > L] < ^Po[|X^|>L] 0, as L ^ oo, 

aeT>h 

we conclude that < uq. So the required inequality follows. H 

The boundedness assumption on g may be relaxed by means of a simple cut off argument: 
Corollary 2.7 Assume that g is of polynomial growth, i.e. 

|(;(a:)| < C(l + 1*1^), for some C,p>0. 

Let M > 0 yg’^ a continuous compactly supported function that agrees with g on Om C 
and satisfies \g^\ < |g|. Let denote the approximations defined in (lO) , with respect to 

g^ in place of g. Then we have 

lim Iuq — lim = 0. 

M-s-O I h-^0 \ 

Proof Define as in 1231 by using the approximation g^, i.e. 

~ supE’"«[/ Rff{t,at,Xr)dt + Rlfg^{X:f)\. 

ot£A *-Jq j 

By Theorem 12.61 we know that = limh_»o . 

Further, we have 

|no-u^| < CsupE^«[g(X?)-(,'^(X?)l 

ocGA L j 

< CsupE’"«[|X^|^ + l;|X?| > m1. 

cga L -J 

Assume M > 1. Then we obtain 

|uo-u^| < CsupE'"«f|Jf^^r;|X?| >m1 < C sup E‘’“ f . (2.9) 

aGA L -I ocGA L M j 

Since /i, a are both bounded, we have 

EPoJix^Ip+iJ < £ p{t,at,Xf‘)dt\^+^ + \ j\{t,at,Xf‘)dBt\^+^'^ < CT. (2.10) 

It follows from (12.911 and ()2.10l) that 

lim |uo — u^ I = 0. 

The proof is completed. H 

We conclude the section with two remarks, both relevant to the numerical simulation of 
the approximation derived in Theorem 12.61 
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Remark 2.8 To approximate in our numerical examples we will as in the proof of Theorem 
\2.6i use fixed funetions ip* for the minimization. The definition (12.51) makes it clear that the 
natural choice for these minimizers are (the numerical approximations of) the function d^u. 
Note that these approximations are readily available from the numerical scheme in m that is 
used to eompute the complementary lower bounds. 

Remark 2.9 In the proof of Theorem \2.6\ we showed that uq < < uo. It therefore follows 

from Remark \2.4\ that there exists a constant C > 0 such that 

Imo — < C/i3 


for all 0 < h < 1 AT. 


3 Some extensions 


3.1 The non-Markovian case 


In our first extension we consider stochastic control problems of the form 

Uo = sup [p(XrA-)l^ 

where X°‘ is a d—dimensional diffusion defined by X°‘ := p.{t, at)dt + fg a(t, at)dBt. Note 

that in this setting p, and a only depend on a and t, but the payoff function g is path dependent. 


Remark 3.1 The arguments in this subsection are based on the ’’frozen-path” approach 
developed in Ekren, Touzi and Zhang [6]. In order to apply their approach, we have restricted 
the class of diffusions X°‘ we consider, compared to the Markovian control problem. 


Writing Pa := Po o {X°‘) we have 


Uo = sup E'*’' 
aeA 



Throughout this subsection we will impose the following regularity assumptions. 


Assumption 3.2 The functions p,a : x A ^ E (E is the respective metric space) and 

<7 : 12^* ^ R are uniformly bounded such that 

• p,(j are continuous in a; 

• p,(j are So-Holder continuous in t, for some constant do € (0,1]; 

• g IS uniformly continuous. 


Example 3.3 Arguing as in Corollary \2.'l\ we may also consider unbounded payoffs. Hence, 
possible path-dependent payoffs that fit our framework include e.g. the maximum max^gp yj oJa 
and Asian options A uJsds. 
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Let 


Ag := {to = 0, ti, to, • • • , tn = T} 

be a partition of [0, T] with mesh bounded above by e. For k < n and nk = {xi = 
0,X2,--- ,Xk) £ denote by path generated by the linear interpolation 

of the points {{ti,Xi)}o<i<k- Where no confusion arises with regards to the underlying parti¬ 
tion we will in the following drop the superscript and write rg(7rfc) in place of 
but it must be emphasized that the entire analysis in this subsection is carried out with a 
fixed but arbitrary partition Ag in mind. Define the interpolation approximation of g by 

5®(7r„) — 5(^re(7r„)^ 

and define an approximation of the value function by letting 

00 ■- supE'^“ |'g®((Bti)o<i<n)l- 
aSA t J 

The following lemma justifies the use of linear interpolation for approximating dependent 
payoff. 

Lemma 3.4 Under Assumvtion \3.1\ we have 


lim Oq = Uq. 

€—^0 

Proof Recall that g is uniformly continuous. Let p be a modulus of continuity of g. If 
necessary, we may choose p to be concave (by taking the concave envelop). Further, we define 


ws(e,T):= sup \Bs - Bt\. 

s,t<T;|s—£|<e 


Clearly, we have 

16^0 - Mol = I sup [g^ {{BtJo<i<n)] - sup E"^" [p(Bta-)] I 

' ct^A aGA ' 

sup E'’’“ [p(wfl(e, T))] < p^ sup [ws(e,r)] y 


It is proved in Theorem 1 in Fisher and Nappo that 

E””“[ws(£,r)] < c(eln^)", 

where C is a constant only dependent on the bound of p and a. Thus, 


The proof is completed. 



We next define the controlled diffusion with time-shifted coefficients by setting 

rs rs 

:= / ij,{t + r,ar)dr + / a{t + r,ar)dBr, s€[0,r —t], Po-a.s., 

Jo Jo 


and the corresponding law: 


:=Poo(X“’‘)“\ 


Further, for 1 < fc < n — 2 let 

Vk ■” ik + 1 ffc; 

and define recursively a family of stochastic control problems: 

:= sup^g^E’‘“ ((7r„_i,2;„_i + a: + , f€[0,77„_i), a; € 

61®(7rfc;t,a;) := sup^g_ 4 E'’“‘' ((tt/c, a;*, + a; + 0,0) j, t € [0, ? 7 fc), a; G R''. (3.1) 

Clearly, 0^(0; 0,0) = 9q. 

Remark 3.5 By freezing the path nt, we get the value function 9^{TTk\ •, •) of a Markovian 
stochastic control problem on the small interval [0,r;fe). This will allow us to apply the PDF 
tools which played a key role in proving the dnal form in the previous section. 

Lemma 3.6 Fix e > 0. The function 9^{'K-,t,x) is Borel-measurable in all the arguments and 
uniformly continuous in {t,x) uniformly in tt. 

Proof It follows from the uniform continuity of g and the fact that interpolation with 
respect to a partition Ae is a Lipschitz function (in this case from R"^*^ into the continuous 
functions), that is also uniformly continuous. Denote by p® a modulus of continuity of 
3 ®, chosen to be increasing and concave if necessary. For any 7r„_i,7r(j_i G given 

t G [0, pn-i], x,x' G R'*, we have 

\9^{TTn-i-,t,x) - 0®(7r(,_i;t,a;')| 

-j-t p I 

< SUpE'’’“ \\g^[{-Kn-l,Xn-l+ X + Br,^_^-t)) 

a£A LI 

< p®(|( 7 r„_i,a;) - ( 7 r(,_i,a;')|). 

Similarly, for any k < n — 1 and TTk,r'i^ G given t G [0,p»s], x,x' G R'^, we have 

\9^nk-,t,x) - 9^{-Kk;t,x')\ 

< supE’’’"*' \\9^ [{nk,Xk + X + Br,i^-t),0,0) - 9^ [{n[,Xk + x'+ Br,i^-t),0,0)W 

aeA LI IJ 

< P®(l( 7 rfe,a:) - ( 7 r^,a;')l)- ( 3 - 2 ) 


- /((7r(j_i,a;„_i + x' + B, 


Vn-l-t 


))|] 


For 0 < t^ < t^ < P/i, it follows from the dynamic programming principle (for a general theory 
on the dynamic programming principle for sublinear expectations, we refer to Nutz and Van 
Handel m) that 


9^{'Kk\t° ,x) = supE'^“’“ [6»®(7rs,;t\a; + Bp_to))l 

a^A L J 


(3.3) 


and (iTSll and dm we deduce that 





n 

< 

sup 

E”^“ 

6^ (TTfc 


a^A 


LI 

< 

sup 

E”^“ 

[p"(|Bp 


a^A 





k+F r 

< 

P ( 

sup E'’’° 

[\B, 


\ 

aeA 


(3.4) 
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Similar to (12.101) . we have the estimate: 


supE^“’"*"[|B,i_to|l = supE>‘4|X“-l"+*“|l < (3.5) 

aeA L J aeA L J 

where C is a constant only dependent on the bound of /r and a. It follows from (13.41) and (13.51) 
that 

\e^Trk-,t°,x) - e^{-Kk-,t^,x)\ < p^(c{t^ . 

Hence, combining (13.21) and (13.51) we conclude that 9^{7rk',t, x) is uniformly continuous in (t, x) 
uniformly in nt- H 

The functions 9^{nk; •) are defined as the value functions of stochastic control problems, 

and one can easily check that they are viscosity solutions to the corresponding Hamilton- 
Jacobi-Bellman equations. For fc = 1,.. ., n — 1, we define a family of PDFs by letting 

—'L^9 = 0, on [0,77ft;) 0 where 

L''6) := dt9 + snp^^j^S^fi{tk +-,9) ■ d^9 + + ■,b)dl^9)y ^ ^ 

The following proposition links the stochastic control problems with the PDF and applies, 
analogous to the Markovian case, a mollification argument. 

Proposition 3.7 There exists a function : {n,t,x) i—> E such that (0,0,0) = 9q + e 
and for all -Kk, (tt*,; •, •) is a classical supersolution to the PDE (13.61) and the boundary 
condition: 

u^^\TTk-,rik,x) = u^^'>{{'Kk,x)\Q,Q), ifk<n-T, 
u^^\TVk-,rik,x) > g‘^({nk,x)), if k = n - 1. 

Proof Define 9^’^{'Kk; ■, •) 9^('Kk', •, •) * for all TTft, £ k < n,where is a smooth 

function with compact support in ( — 1,0) x Oi (Or is the unit ball in E*^), and K^{t,x) := 
K{t/, X/S). By Lemma f3.6l 9^’^{TVk', •, •) converges uniformly to 9^{'nk\ •, •) uniformly in 
TTft;, as (5 ^ 0. Take 5 small enough so that \\9'^’^ — 0®|| < Further, Assumption 13.21 implies 
that all the shifted coefficients pitk + •, Oi ^{tk + •, •) satisfy the assumptions on the continuity 
of the coefficients in [HI, where the author proved that 

9^’^{TVk', •, •) is a classical supersolution for (13.61) . 

Note that 9^'^ (nk', ■,■) + C is still a supersolution for any constant C. So we may define a 
smooth function u^(0; •, •) := 0'^’'^(O; •, •) + Co on [0,ti] x E'* with some constant Co such that 

u"(0; 0, 0) = r (0; 0, 0) + u"(0; •, ■) > ^(0; ■, •)■ 

Similarly, we define smooth functions {-Rk’, ■■,■) ’■= on [ 0 , 77 ft;] x E"^ for 

1 < fc < n — 1 with some constants Ct,^. such that 

li^(7rft;; 0, 0) = u®(7rft;_i; 77ft;_i, Xk - Xk-i) + -, ^^(Trft;; •, •) > 9^{'Kk\-, ■) ■ 

n 

Finally, we define for TTft, £ and {t,x) £ [O,?;*,) x E'* 

/ V e , ■. Tl - k 1 

’{'Kk\t,x) := V {Rk\t,x) H-£. 
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It is now straightfoward to check that satisfies the requirements. 


The discrete framework we just developed may be linked to pathspace by means of linear 
interpolation along the partition Ae. Recall that 0 was defined to be [0, T] x 

Corollary 3.8 Define : Q ^ M. by 

:= u^^'’[{uJti)o<i<k\t— for t € [tfc, tfc+i). 

There exist adapted processes Tt such that for all a £ A 

u'^\t,X°‘) = J (^Xt+p,{t,at)ipt + ^Tr[{aa'’){t,at)rit)'j{X°')dt+j tpt{X°'ya{t,at)dBt, 

Po-a.s., and 

(^Xt + + ^Tr[{aa'^){t,at)r]t^{uj) <0, for all a £ A, {t,uj) £ <d. 

Proof By Ito’s formula, we have 

u^^\t,X°‘) = u^^\tk,X°‘)+ f ^Xs + fJ.{s,as)(ps + ^Tir[{aa'^){s,as)r]s)'^{X°‘)ds 

+ f (ps{X°‘ya{s,as)dBa, ior t £ [tk,tk+i), Po-a.s., 

with 


Xt{ijj) ■— dtu^''\{u)ty)o<i<k\ t — tk,ijJt — 

ifitioj) ;= ((a;tJo<i<fc; t - tk,ajt - uJt^) , for t € [tk,tk+i)- 

77s(w) := {{u!ti)o<i<k;t - tk,iOt- eotk), 

By the supersolution property of proved in Proposition 13.71 we have 

(^At -I- /i(t, at)<Pt + iTr((crcr'^)(t, at)rjt)'^ {uj) 

< L'‘u^^\{oJti)o<i<k;-,-){t-tk,OJt-OJty < 0 . 


The proof is completed. 


Finally, we prove an approximation analogous to Theorem 12.61 in our non-Markovian setting. 
Theorem 3.9 Suppose Assumption 1 3. g| holds. Then we have 


uo 


pT 

lim ii^, where ■.= inf [ sup IgtXx^.)— / atldStll. 

7 i —>0 (pew iaGTOk ^ JO 


Proof Arguing as in the proof of Theorem \m one can easily deduce using the Ito formula 
that uq < lim 7 ,_»ou^. 

Consider the function and let ip be the process defined in Corollary 13.81 We have 

E'^“[suj) \^g{XTp,.)- j pt{X'"y<r{t,at)dBt^'^ 

- [di^TA-) - -I- 4 ^^}] 

< E'*”” sup ((X“^)o<i<n) Ij - 1 - 00 +S' 


h , 
V < 
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For the last inequality, we use the fact that 0, 0) = + £■ Note that there are 

only finite elements in the set T>h. Therefore, by Lemma 13.41 

— lljo ( 51/ [|5(^TA-) “ )o<i<ra) |] +^ 0 +e) 

= uo- 

We conclude that < uq for all G (0,1 A T]. H 

3.2 Example of a duality result for an American option 

In this subsection we give an indication how our approach may be extended to American 
options. To this end we consider a toy model, in which the d-dimensional controlled diffusion 
X°‘ takes the particular form X°‘ := a'^dt -f a\dBt and carry out the analysis in this 
elementary setting. The stochastic control problem is now 

uo = sup E'’’“ [^(X^)], 

Q:G-4,tG7t 

where 7 t is the set of all stopping times smaller than T. Throughout this subsection we will 
make the following assumption: 

Assumption 3.10 Suppose g : R'* —>■ R to 6e bounded and uniformly continuous. 

For a € A define probability measures Pa := Poo(X“)“^, let V := {Pa : Q G A} and define 
the nonlinear expectation T[-] := suppg^ It will be convenient to use the shorthand B 

for the stochastic integral aldBs- We have 

Uo = sup f [^(Bt)]. 
tG7t 

Further, we define the dynamic version of the control problem: 

u{t,x) ~ sup £[g{x + BT)], for (f, a;) G [—1, T] x R'^. 

TSTT-t 

The following lemma shows that the function u satisfies a dynamic programming principle 
(see for example Lemma 4.1 of [3 for a proof). 

Lemma 3.11 The value function u is continuous in both arguments, and we have 
u{ti,x)= sup £^g{x BT)l{T<t 2 } T u{t 2 , XBt 2 )l{r>t2}]- 

^eTr-ti 

In particular, {u(f, Bt)}t£[o,T] is a V-supermartingale for all P G B. 

Next we apply the familiar mollification technique already employed in Section [2. 2 1 Define 
u^®^ := u* 

Lemma 3.12 {u^^\t,Bt)}t is aV-supermartingale for allf' £ V, andu^'^'^ > g^^^ ~ g * 
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Proof 


< 

< 


For any s < t <T and a; € R, we have by Lemma ld.lll 

£[u^^\t,x + Bt-s)] = £^J u{t - r,x - y + Bt- 3 )K^'^\r,y)dydr^ 

j S [u{t -r,x - y + -Bt-s)] (r, y)dydr 

J u{t — r — (t — s),x — y)K^^\r, y)dydr 

= J u{s — r,x — y)K^^\r,y)dydr = u'’^\s,x), 

where for the second inequality, we used the P-supermartingale property of {u{t, Bt)}t£io,T] 
for all P € P. This implies that for all P G P we have 

+ Bt_s)] <u^'^\s,x). 

Therefore, (t, Bt)}t is a P-supermartingale for all P G P. On the other hand, it is clear 
from the definition of u that u> g and, hence, u^®^ > H 


Again, the stochastic control problem can be discretized. For technical reasons, we assume 
here that the partitions of time satisfy the order: 

C {t? , for h>h', (3.7) 

where nn is the number of the time grids of the partition. 

Lemma 3.13 Under AssumMion \S.liK it holds 

uo = lim Uq, where Uq -.= sup E'’’“ . (3.8) 

Proof We only prove the case = 0 and 0 . = G a compact set in R, in particular, 

X°‘ = {a ■ B). The general case follows by a straightfoward generalization of the same argu¬ 
ments. Note that it is sufficient to show that uo < lim ^ Uq. Fix e > 0. There exists a® G A 
such that 

Uo < sup E‘’“[(7((a®.B).)] +£. (3.9) 

tETt 

For any h sufficiently small define a process by letting 



Clearly, is piecewise constant on each interval We introduce the filtration F := 

{Xh}h, with 

A := A, Ai) X A : i < - 1, A G A'*})- 

In particular, it follows from (irr)! that Fh C A' for h > h'. Also, denote the probability P 
on the product space 0: 

P{dt,duj) idt x Po(duj). 
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Note that for all i < n/j — 1, ^4 G T^h and h' < h we have 


[s'*' 




= E’’“ 


= E’’“ 


' .h' ^.h 


1—3 ’j + 1— i + l 


[al\J^t^]ds IA 


~ E’’“ l{[t'*.t'*_^JxA}]• 

So {ot^}h is a martingale in the filtrated probability space (0,P, F). Note that a® and are 
bounded, so it follows from the martingale convergence theorem that 

fT 

lim E*^” / (q;® — ds = 0 (3.10) 

Jo 

Further, define := h ^-nd note that we have d^ G Ah- It follows from (13.101) that 

fT 

lim E*^” / (q;® — Oil)^ds — 0. 

h^o Jo 

With p an increasing and concave modulus of continuity of g we have 

sup E*"" [g{{ci ■ B)-,)] - sup E'" [g((d'' • B)-r)] 


tGTt 


t£Tt 


< sup E””” [p(|(a® • B)t - (o'* • B)t|)1 

tGTt ^ ^ 

< E'«[p(||(a®.B)-(d'‘.B)||„o] 


= [J (“s - dj)^ds] "" ^ 

Combining (13.9F (13.111) we have 


(3.11) 


Uo < sup E'‘'° (^(d'‘■ i3)T^j + p(^E'’'“ j^y (. 

< Uo+p(e^°\^J (a® - da)^dsj - 




Letting h —>• 0 we deduce 


for all e > 0. 


Uo < lim^^gMo + £. 


We conclude the section by proving the analogous approximation result for American 
options. 

Theorem 3.14 Suvvose Assumvtion \3. iVi holds. Then we have 

Uo = lim where := inf E'’”“ [ sup |p(X“)— / ips(X“)’^asdi3s'll. 

V’SM I agX'fc,ig[0,T] I Jo I I 
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Proof We first prove that the left hand side is smaller. Recall Mq defined in (13.81) . For all 
(/3 € W, the process <fitiX°‘yatdBt is a martingale, and we have 

Uq < sup [p(X“) — f for all G W. 

a£Ah,TGTT ^ Jq 

Since for any a € Ah and t € Tr we have g(X^)—fJ ipt{X°‘y aldBt < sup^gj,^ jgjg yj — 

Jg ips{X°‘ya^dBa'^, we obtain 

Wq < [ sup Ig(Xf)— f , for all ip G W. 

Laer>h.te[o,T] 1 Jo ^ 

The required result follows by Lemma 13.131 

For the converse, recall that Bt) is a P-supermartingale for all P G P fLemma l3.12l) . 

Further, since G C^’^, we have 

+ sup < 0. 

(60,61)6/1 '' 2 J 


Hence, for all ft > 0 


Vh < 


: e'" 

sup 

L aeT)^,te[0,T] 

{gixt) 

- f d.yy{xyal 

dH.}] 


: E*‘“ 

sup 

L aeT>h,te[0,T] 

{gixt) 

-uy\xt)+uy'' 




+1^ (^dtui^yx:) 

+ y, ■ dxufyxg) + 

^Tr{al{a\y 

i")(X:)))ds}] 

: e''“ 

sup 

{g{x^) 

-g^^\xn}]+uy\ 




LaGI3;,,t6[0,T] 




have used Ito’s formula and the inequality m^®^ 

> g''^'’ proved in 

Lemma |3.12] It is 


straightforward to check that 


lim ( E””" [ sup {g{Xt) - sr'^\X“)}l + 

E-lO Y La6I3h,t6[0.T] J J 


(e) _ 


Mo- 


4 Examples 

4.1 Uncertain volatility model 

As a first example, we consider an uncertain volatility model (UVM), first considered in [T] 
and [15] . Let A C x be a compact domain such that for all (cr*, P*'’) € A, the 

matrix 

( 

is positive semi-definite, p”-’ = fp'’ G [—1,1] and p" = 1. If d = 2 an example of such a domain 
is obtained by setting 

A = ^|(^p i)'^^ } ’ 
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where 0 < ct* < ct* and —Recall the definition of A, i.e. an adapted process 
(ct, p) = Pt)o<t<T £ ^ if it takes values in A. In the UVM the stock prices follow the 
dynamics 

d{W\ W^)t = p'Ut, l<i<j<d, 

where W'’ is a 1-dimensional Brownian motion for all i < d, and {a, p) € ^ is the unknown 
volatility process and correlation. The value of the option at time t in the UVM, interpreted 
as a super-replication price under uncertain volatilities, is given by 

Ut = sup (4.1) 

For European payoffs, = g{ujT), the value u{t,x) is then the unique viscosity solution 

(under suitable conditions on g) of the nonlinear PDE: 

dtu{t,x) + H{x,D^u{t,x)) = 0, u{T,x) = g{x) (4.2) 


with the Hamiltonian 
1 

H{x,'y) = - max , for all a; € R'*, 7 € 

2 {<^\p'Ai<i,i<deA 


Second order backward stochastic differential equation (2BSDE) 

Fix constants d = ( 6 '*)i<i<d and p = Denote a new diffusion process X: 

dXi = d^XidWi, d{W\W^)t = p"^dt, l<i<j <d, 

where W’’ is 1-dimensional Brownian motion for all 1 < i < d. Consider the dynamics: 

dZt = ^tdt VtdXt^ . , 

dYt = -H{Xt,Tt)dtAZtdXt + \{dXtyTt{dXt)dt, 

where (Y, Z,V ,'E.) is a quadruple taking values in R, (the space of symmetric dx d 

matrices) and R*^ respectively. In particular, if the HJB equation (|4.2ll has a smooth solution, 
it follows from the Ito’s formula that 

Yf.= u{t,Xt), Zt~dMt,Xt), rt:=dLuit,Xt) (4.4) 

satisfy the dynamics (|431) with a certain process In Cheridito, Soner, Touzi & Victoir [3], 
the authors studied the existence and uniqueness of the quadruple (V, Z, F, H) satisfying the 
dynamics (Ira with the terminal condition Yt = gi^Xr), without assuming the existence of 
smooth solution to the HJB equation (iTl) . and they gave the name ‘2BSDE’ to this problem. 
For the readers interested in the theory of 2BSDE, we refer to [3] and Soner, Touzi & Zhang 
|19 | for more details. 
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Numerical scheme for 2BSDE 


We are interested in solving the 2BSDE numerically. In the existing literature, one may find 
several different numerical schemes for this problem (see for example [31[51[TD]). Here we recall 
the one proposed in Guyon & Henry-Labordere [10]. Introduce the partition on the 

interval [0, T], and denote Ati = ti — AWti ~ First, the diffusion X can 

be written explicitly: 

X{. = X^e~^'^ ^ , with AW^.AWt^ = pjkAU. 

Denote by Y,T the numerical approximations of H, F. In the backward scheme in [10], we set 
= g{Ytn)j a-iid then compute 

a^a'^XiX^ - (AU)-^- a^UlSjk)] 

with Ul. : = Pjk ^/^ti, and 

where Ei denotes the conditional expectation with respect to the filtration Below, we 

denote := Yq. 


Lower and upper bound for the value function 

Once r is computed, one gets a (sub-optimal) estimation of the controls {a*, p*)-. 

d 

iK’Pti) ■- argmax(^,_pjfc)^^^ ^ for 0 < i < n. 

j,k^l 

Performing a second independent (forward) Monte-Carlo simulation using this sub optimal 
control, we obtain a lower bound for the value function 63|: 

:= E[5(xf ■^*)] < uo. 

We next calculate the dual bound derived in the current paper. As mentioned in Remark 
EHl we will use the numerical approximation of d^u to serve as the minimizer p* in the dual 
form. Also, we observe from (14.411 that the process Z in the 2BSDE plays the corresponding 
role of dxU, and we can compute the numerical approximation Z of Z: 

a^Xlzi=Ei.r[%Ui;\. 


Then we define 

n 

i = l 

Using our candidate p* in the minimization, we get an upper bound 


LS 

Uo 


< Uo 


^ dual 
< Uo 


n 
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The algorithm 

Our whole algorithm can be summarized by the following four steps: 

1. Simulate A^i replications of X with a lognormal diffusion (we choose a = {a + a) /2). 

2. Apply the backward algorithm using a regression approximation. Compute Yq = 

3. Simulate N 2 independent replication of X'^ using the sub-optimal control 
Give a low-biased estimate 

4. Simulate independent increment AVKt. and maximize 9 {X^^^) — ~ 

over {a, p) £ 'Dh- In our numerical experiments, as the payoff may be non-smooth, 
we have used a direct search polytope algorithm. Then compute the average. 

Numerical experiments 

In our experiments, we take T = 1 year and for the i-th asset, Xq = 100, = 0.1, a' = 0.2 

and we use the constant mid-volatility d' = 0.15 to generate the first Ni = 2^® replication of 
X. For the second independent Monte-Carlo using our sub-optimal control, we take N 2 = 2^® 
replications of X and a time step Als = 1/400. In the backward and dual algorithms, we 
choose the time step A among {1/2,1/4,1/8,1/12}, which gives the biggest Uq^ and the 
smallest The conditional expectations at ti are computed using parametric regressions. 

The regression basis consists in some polynomial basis. The exact price is obtained by solving 
the (one or two-dimensional) HJB equation with a finite-difference scheme. 

1. 90 — 110 call spread {Xt — 90)^ — {Xt — llO)'*', basis= 5-order polynomial; 

uo® = 11.07 < = 11.20 < uo"*"* = 11.70, = 10-30 

2. Digital option lx 2 ->ioo, basis= 5-order polynomial: 

Mo® = 62.75 < M™'® = 63.33 < mo"*"* = 66.54, mo®°® = 52.03 

3. Outperformer option {X^ — X^)^ with 2 uncorrelated assets, 

L 5 iiicr —' DE -1-1 o cr ^ cius/i 1104 ^ S DE -i -i a cy 

Uq = 11.15 < Mq = 11.25 < Mq = 11.84, Mo = 11.48 

4. Outperformer option with 2 correlated assets p = —0.5 

Mo® = 13.66 < Mo°'® = 13.75 < Mo"*"* = 14.05, Mo®°® = 14.14 

5. Outperformer spread option {X^ — O.OXy)"*" — (Ay — l.lAy)^ with 2 correlated assets 

-0.5, 

Mo® = 11.11 < Mo°® = 11.41 < Mo'"'"' = 12.35, Mo®°'® = 9.94 
In examples 3.-5. the regression basis we used consists of 

(1, A\ A^ (A^)^ (A^)^ A^A^}. 

Remark 4.1 The dual bounds we have derived complement the lower bounds derived in m- 
They allow us to access the quality of the regressors used in computing the conditional expec¬ 
tations. 
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4.2 Credit value adjustment 

Our second example arises in credit valuation adjustment. We will show that for this particular 
example, we can solve the deterministic optimization problems arising in the dual algorithm 
efficiently by recursively solving ODEs. 


CVA interpretation 

Let us recall the problem of the unilateral counterparty value adjustement (see [TT] for more 
details). We have one counterparty, denoted by C, that may default and another, B, that 
cannot. We assume that B is allowed to trade dynamically in the underlying X - that is 
described by a local martingale 

dXt = a{t,Xt)dWt, with W a Brownian motion. 


under a risk-neutral measure. The default time of C is modeled by an exponential variable r 
with a intensity c, independent of W. We denote by uq the value at time 0 of B’s long position 
in a single derivative contracted by C, given that C has not defaulted so far. For simplicity, 
we assume zero rate. Assume that g^Xr) is the payoff of the derivative at maturity T, and 
that u is the derivative value just after the counterparty has defaulted. Then, we have 


Uo 


+ u(t, Xt)1{t<T}^ 

g^Xr) + j u(t,. 


Write down the dynamic version: 

u(t,x) = E^ e~‘^^‘’~^^cu{s,Xs)ds^Xt = x^. 
The function u can be characterized by the equation: 

dtu + x)di^u + c{u — u) = 0, u(r, x) = g{x). 

At the default event, in the case of zero recovery, we assume that u is given by 


u = u , 

where x~ := max(0, —x). Indeed, if the value of u is positive, meaning that u should be 
paid by the counterparty, nothing will be received by B after the default. If the value of u is 
negative, meaning that u should be received by the counterparty, B will pay u in the case of 
default of C. 

Remark 4.2 The funding value adjustment (FVA) corresponds to a similar nonlinear equa¬ 
tion. 

By the following change of variable 

u{t,x)^^^ = a:). 


19 


the function satisfies the HJB equation: 

+ ^a^{t,x)dxxU^^^ +c(u*^''®)“ = 0, u^^^{T,x) = g{x). (4-5) 

The stochastic representation is: 

a:) = sup eTc” g{XT)\Xt = x], with A:=[0, c], 
aeA L J 


Dual Bound 

We are interested in deriving an efficient upper bound for Xq). Denoting Rt = e^o 

our dual expression is 


m“-^“(0,Xo) = lim inf E 

h —^0 ip^lA 


sup {7??,5 (Xt) - r Rtip{t,Xt)dXt} 
Jo 


< lim E 

h —^0 


sup {RrgiXT) - [ 
o-^T^h Jo 


Rl^*{t,Xt)dXt} , 

where tp* is a fixed strategy. Rewriting the integral in Stratonovich form, we have 

[ Rtp\t,Xt)dXt 
Jo 

= r Rtp*{t,Xt)odXt-]- r RtdxP*{t,Xt)a\t,Xt)dt 

Jo 4 Jq 

Therefore, using the classical Zakai approximation of the Stratonovich integral, it follows that 


E 


= lim E 

n —^oo 


= lim E 


< lim E 

n —>-oo 


sup iR^giXr)- [ Rtp*{t, Xt)dXt] 

sup \ RTgiXr) - [ Rtp*{t,Xt)odXt +1- f RtdxP*{t,Xt)a^{t,Xt)dt\ 

le-Dh ''Jo 4 Jo J 

- ^dxp*{t,xry{t,xn)dt 


sup 

aeVh 




sup {R^g{X^) - [ Rt(p*{t,xn<x{t,xnwr - id.^*(t,xr)a"(t,xr))dt 
ae® '' Jo ^ ^ ' 


where X> := {o : [0, T] —>■ R| a is measurable, and 0 < at < c for all t € [0,r]}. For almost 
every ui we may consider for all n the following deterministic optimization problem. Set 

a^At) = -AitAT iAMtATiAWr A) , 

/3.,„ it) = ^dxP*{t,XT {Lo))A{t,XT{A), 

and consider the function: 

HJ 




(t) = sup I + f ^(a,...„(s)+/3„,„ (s))ds|. 


Note that is the solution of the (path-wise) Hamilton-Jacobi equation: 

(f) + c (t)) + au,,n{t) + A.uit) ^ 0, A^n{T) ^ gu,,n. 


20 














The ODE for can be solved analytically. Fix a, G [0, T], and let 
t* = sup |s < < o| V 0. 

For all t £ [t*, to] we get the following recurrence equation: 

= 

= 

Finally, we observe that, 

u“''®(0,Xo) < lim 

n—^oo '• •' 

We illustrate the quality of our bounds by the following numerical example. 

Remark 4.3 This example falls into the framework of ^ and m- By virtue of their (eon- 
tinuous) pathwise analysis the upper bounds derived above could in the limit be replaced with 
equalities. Only the error introduced by the choice of </5* remains. 

Numerical example 

We take a(t,x) = 1, T = 1 year, Xo = 0. g{x) = x. We use two choices: (p*(t,x) = 

(which corresponds to dxU^^^ at the first-order near c = 0) and (p*{t,x) = 0. We have 
computed E[u 2_'^„(0)] as a function of the time discretization (see Table [T] and El). The exact 
value has been computed using a one-dimensional PDE solver (see column PDE). We have 
used different values of c corresponding to a probability of default at T equal to (1 — 

The approximation has two separate sources of error. First, there is the suboptimal choice 
of the minimizer ip* for the discretized optimization implying an upper bias. The second 
error arises from the discretization of the deterministic optimization problems, which could 
underestimates the true value of the optimization. The choice p* = in our example - 

as expected - is close to be optimal, so the errors arising from the discretization dominate. In 
the contrary, the choice ip* = 0 is far from being optimal, so the numerical results are much 
bigger than the value function. 


e ^\aui,n{s) + I3ui,n{s))ds + U^^„{t°)e''^^ < 0 


-y (aui,nis) +/3u:,n{s))ds + 


HJ /.0\ ... n 


Qu},r\ 


c , (1 - 

PDE 

1/2 

1/4 

1/8 

1/12 

1/24 

1/50 

1/100 

1/200 

0.01 (1%) 

0.26 

0.23 

0.25 

0.26 

0.26 

0.26 

0.26 

0.26 

0.26 

0.05 (4.9%) 

1.29 

1.14 

1.22 

1.26 

1.27 

1.28 

1.29 

1.29 

1.29 

0.1 (9.5%) 

2.52 

2.24 

2.39 

2.46 

2.48 

2.51 

2.52 

2.52 

2.52 

0.7 (50.3%) 

13.60(0) 

12.63(1) 

13.25(2) 

13.53(5) 

13.61(7) 

13.71(18) 

13.75(44) 

13.77(112) 

13.77 


Table 1: The numerical results of E[m^'{j( 0)] with the different time steps when p*{t, x) = 

The numbers in the brackets indicate the CPU times (Intel Core 2.60CHz) in seconds for the case 
c = 0.7 with N = 8192 Monte-Carlo paths. 


21 

















c , (1 - e-^^) 

PDE 

EKi(O)] 

0.01 (1%) 

0.26 

0.40 

0.05 (4.9%) 

1.30 

1.95 

0.1 (9.5%) 

2.53 

3.80 

0.7 (50.3%) 

13.60 

20.08 


Table 2: The numerical results of E[m^{j( 0)] when ip*{t,x) = 0. 
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